This thesis designs a web information extraction system which based on semantic structure of the website. The system consists of three main components: website spider, website semantic structure generator, web information extractor. 本文构建了一个基于网站语义结构的信息抽取系统,系统由三个主要部分组成:网站网页搜索器,网站语义结构生成器,网页信息抽取器。
On the basis of researching and analyzing the characteristic and the existence question of Blog system and Semantic Blog system how to issue the information and organize the structure of information. 4. 对现有的Blog系统和SemanticBlog系统中信息发布的结构和组织关系的特点及其存在的问题进行研究和分析。
Semantic structure may be viewed as an ideal analog of information theory. 语义结构可以看作信念的一种理想化信息论模型。